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ABSTRACT 



The present invention relates to a system and method for 
canceling echoes from telecommunications networks by 
providing a multi-adaptation echo canceller. Unlike conven- 
tional TDM echo cancellers, th e present invention uses at 
least two ech o cancellers ar ranged in a novel manner to 
cancel ecnoes generated bv a voice p a cket associ ated with 
an IP network or the lik e. The present invention further 
^provides a double adaptation echo canceller that transfers 
coe fficients from a first echo canceller to a second ec ho 
canceller, ine pres ent system compares the single and 



double adaptation echo error signals and 
canceller witn the smaller error signal 1 



selects the echo 

This process is 

continuously repeated so that the systcfn can select the 



o ptimal echo cancelle r for canceling echoes tor cacn voice 
block/packet. The present invention can be implemented 
with any number of echo cancellers so long as the each block 
or packet can be processed within a specified time frame. 

24 Claims, 7 Drawing Sheets 
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MULTI-ADAPTATION FOR A VOICE 
PACKET BASED 

This application claims benefit of provisional application 
Ser. No. 60/124,708 filed Mar. 15, 1999. 

FIELD OF THE INVENTION 

The present invention relates to the field of echo 
cancellers, and more particularly, to a multi-adaptation echo 
canceller. The present invention is also directed to a novel 
"method and apparatus for canceling echoes associated with 
voice packet networks. 

BACKGROUND OF THE INVENTION 

In Time Division Multiplex (TDM) based telephone 
systems, echoes are generated when a sender's speech is 
reflected back to the sender from a hybrid interface. FIG. 1 
illustrates a conventional hybrid interface 4 between a two 
wire line that connects a near end subscriber 2 to the hybrid 
interface 4 and a four wire line (not shown) used for 
transmission of analog or digital signals within a local 
exchange. Any speech at point 6 from a far end subscriber 
that re -emerges at point 8 via the hybrid interface 4 is an 
echo. Speech at point 8 is transmitted to the far end 
subscriber and may include speech from the near end 
subscriber 2 and echoes generated by the hybrid interface 4. 
Such echoes may be annoying, and under certain conditions 
can completely disrupt a conversation between the far end 
subscriber and the near end subscriber. To attenuate such 
echoes, digital filters are generally used in TDM networks. 

Conventional echo cancellation techniques that arc cur- 
rently available are improvements in this field, but they do 
not eliminate the echoes completely. One conventional 
method uses a transformer with a number of passive ele- 
ments. Echoes occur in telecommunication networks due to 
impedance mismatches at hybrid transformers that couple 
the two wire line to the four wire line. Ideally, the hybrid 
transformer transmits the far end subscriber's speech signals 
at the four wire receive port through to the two wire transmit 
port without leakage into the four wire transmit port. 
However, this would require exact knowledge of the imped- 
ance seen at the two wire port, which varies widely and can 
only be estimated. As a result, leakage signals in the form of 
echoes are transmitted to the far end subscriber. 

Another conventional method, an adaptive digital echo 
canceller, will now be described with reference to FIG. 2. 
FIG. 2 is a block diagram illustrating an arrangement having 
a conventional adaptive echo canceller. In general, adaptive 
digital filters are used to replicate different impulse 
responses associated with telephones and to compensate for 
variations in the impulse responses caused by changes in the 
subscriber loops. 

Digital speech signals at point 32 from the far end 
subscriber are converted to analog signals by a digital to 
analog converter 22 before the signals reach the hybrid 
interface 4. Speech signals from the near end subscriber 2 
and/or echo signals generated by the hybrid interface 4 are 
converted to digital signals by an analog to digital converter 
24. The digital signals are then iraasmitted from the A/D 
converter 24 to a transmission delay 26, which is caused by 
other iraasmLssion devices. The digital speech signals from 
the near end subscriber 2 and/or the echo signals generated 
by hybrid interface 4 arc then transmitted to a subtractor 30. 

Simultaneously, an adaptive filter 28, i.e. Least Means 
Squares (LMS) digital filler 28, receives the far subscriber 
speech signals from point 32. Filters using other algorithms 



such as Recursive Least Squares (RLS) may be substituted 
for the LMS digital filter. The adaptive filter 28 generates 
synthetic echo signals based on the real speech signals. The 
synthetic echo signals arc then subtracted from the real echo 
5 signals using the subtractor 30, and thus reducing the echo 
signals transmitted to the far end subscriber. Subtracting the 
synthetic echo signals from the real echo signals will reduce 
the amplitude of the echo, and the remaining signal after this 
subtraction is called the "error signal." The error signal is 
30 forwarded to the filter 28 where the coefficients of the filter 
28 are adjusted accordingly in an effort to minimize future 
echoes from being transmitted to the far end subscriber. The 
magnitude of the echo and the lime delay constitute the echo 
path transfer function. 
15 Adaptive methods such as that described above generally 
rely on information contained in the speech signals from the 
far end subscriber. Echo cancellers have been used for TDM 
networks, which generally uses a sample by sample adap- 
tation. Adaptive filters generally include coefficients that are 
20 adjusted based on the LMS or RLS algorithm. The coeffi- 
cients arc updated in a manner opposite to the gradient of the 
error signal multiplied by a constant number. This constant 
number is usually referred as an adaptation step size, which 
effectively controls adaptation speed. However, a large 
25 adaptation step size results in stability problems, which 
sometimes goes into signing or oscillation situations. Such 
problems get worse when the step size is increased exces- 
sively in order to increase adaptation speed. 

There are generally two key performance parameters of 
30 an echo canceller: ERLE (echo return loss enhancement) 
and converge time. Hie ERLE is the degree to which the 
echo canceller suppresses the echo signal, i.e. the ratio 
between the echo signal and the error signal measure in dR. 
The converge time is the time required to reach the ERLE of 
-* 1 26 dB or greater. Currently, the ITU requirement for the 
converge time is less than 500 ms for a continuous excitation 
and less than 1 second for speech like bursts. 

With the advent of voice packet technology, i.e. Voice 
over Internet Protocol (VoIP), echo cancellers are becoming 
increasingly important. This is because the latency and 
transmission delay of an IP network is much longer than a 
traditional TDM network. Because the human's perception 
of an echo signal is proportional to the delay it experiences, 
controlling its ERLE and converge time performance is 
highly desirable for a good voice quality. 

There are differences in the design of echo cancellers for 
TDM and IP networks. For example, the echo canceller in 
the TDM network basically uses a sample by sample 
process, whereas an echo canceller for an IP network would 
use a block by block or packet by packet process. 

Thus, there is a need for a system and method that can 
provide echo cancellation by way of a block by block or 
packet by packet process and can increase converge speed 
55 while maintaining robust stability. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide an echo 
canceller that can provide stability and fast converge time. 
60 It is another object of the present invention to provide an 
echo c anceller that can provide a multi-adaptation, compare, 
and s elect process for canceling echoes;" 

It Is yet another object of the present invention to provide 
an echo c anceller having multiple adaptive filters, and thus 
65 allowing the system to choose tnc optima I nq cr. 

It is yet another object of the present invention to provide 
an echo canceller thai can cancel echoes in an IP network. 
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It is yei another object of the present invention to provide 
an echo canceller that can cancel echoes using a block by 
block or packet by packet process. 

These and other objects of the present invention arc 
obtained by providing a multi-adaptation echo canceller that 5 
transfers coefficients from a first echo canceller to a second 
echo canceller as the initial coefficients for the second echo 
canceller. The system c ompares the single and double adap- 
tation echo eilTH signals and selects the echo canceller with 
hne smaller error signal. This process is repeated so that the 10 
«^>MUH cat! select (h e optimal edio cancetter tor canceli ng 
^cnoes as sociated witn each voice block/packet. T he present 
int'enuon can be implemented with N number of adaptation 
echo cancellers so long as the each block or packet can be^ 
processed within a speeilieil lime Irame. r 
^ — — 
BRIEF DESCRIPTION OF THE DRAWINGS 

These and other objects and advantages of the present 
invention will become apparent and more readily appreci- 
ated from the following detailed description of the presently 
preferred exemplary embodiment of the invention taken in 
conjunction with the accompanying drawings, of which: 

FIG. 1 illustrates a simplified block diagram showing a 
conventional two to four wire hybrid interface connecting a ?5 
near end subscriber to a far end subscriber; 

FIG. 2 illustrates a block diagram of an arrangement 
having a conventional adaptive echo canceller; 

FIG. 3 illuslrales an implementation of an echo canceller ^ 
in accordance with the preferred embodiment of the present 30 
invention; 

FIG. 4 is a block diagram of an echo canceller in 
accordance with the preferred embodiment of the present 
invention; 

FIGS. 5A and 5B illustrate flowcharts of a method for 
canceling echoes in accordance with the preferred embodi- 
ment of the present invention; and 

FIG. 6 illustrates an implementation of an echo canceller 
in accordance with another preferred embodiment of the 
present invention. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENTS 

The preferred embodiment of the present invention will 
now be described with reference to FIGS. 3-6. The present 
invention is directed to a multi-adaptation echo canceller 
using a block by block or packet by packet process. The 
present invention is primarily adapted for canceling echoes 
associated with IP networks. However, one skilled in the art 
will appreciate that various substitutions and modifications 
can be made to the examples described herein while remain- 
ing within the spirit and scope of the present invention. For 
example, although a double adaptation echo canceller sys- 
tem and method is presented herein, a derivative architecture 55 
using N number of adaptation echo cancellers should be 
appreciated. 

FIG. 3 illustrates an implementation of an echo canceller 
in accordance with the preferred embodiment of the present 
invention. The present invention provides echo cancellation 60 
for IP networks, 11.323 networks, or the like, using a block 
by block or packet by packet process. As illustrated, IP 
network 50 communicates with an interface 52 that strips the 
header information associated with the packets. Next, a 
conventional decoder 54 such as a PCM decoder is used to 65 
decode the stripped packets before transmitting them to the 
echo canceller (EC) 56 of the present invention. The decoder 
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54 also is used to convert the compressed signals to linear- 
coded signals. The EC 56 of the present invention is thus 
exposed to only digital data and does not communicate 
directly with the IP network 50. 

Simultaneously, the stripped packet data arc transmitted 
to a hybrid 70. First, the packets are de-packeted so as to 
generate a constant stream of data. Then, the digital signals 
associated with the packets are converted into analog signals 
via D/A converter 58. As described earlier, echoes are 
created when the analog signals reach the hybrid 70. Echo 
signals and speech signals from a near end subscriber 72 are 
converted into digital signals via A/D converter 72. These 
signals are packeted into its respective network required 
length before they are transmitted to the EC 56. Once the 
echo signals in each packet are cancelled using the EC 56, 
each packet is then encoded to a compressed form via an 
encoder (5X) 74. Each packet is then encapsulated by the 
interface 52. 

FIG. 4 illustrates a detailed block diagram of the EC 56 
in accordance with the preferred embodiment of the present 
invention. Additionally, FIGS. 5A and 5B illustrate flow- 
ch arts of a method for canceling, echoes using the EC 5 67 
Reference numbers in FIGS. 4, 5A, and Sli are used in the 
following discussion to describe the system and method of 
the present invention. 

The EC 56 of tt]e present invention may be implemented 
wim software or hardware, or both. A far end voice packe t 
Xfr twherc kjs.lhe time stamp) originating from a far end 
sub^L'UDet I s transm itted to a 1 to 4 wire hybrid interface 108 
via a transrnissionj deiay lOft, w hich is caused by other 



' transmis sion devict s. Echo signals Y t are generated from 
Ihe hybrid interface 10_8, navmg an e cho amplitude relat ing 
to the loop termination . For instance, a short circuit will have 
no echo back and no loop termination (open circuit), and 
thus will generate a maximum echo. A loop termination of 
about 900-ohm will generate an echo somewhere between 0 
and maximum echo. Another transmission delay 110 
receives the echoes from the hybrid interface 108. In most 
cases, the round trip delay is between 0 to 32 ms, but 
sometimes may be longer. In the preferred embodiment of 
the present invention, the round trip delay of about 32 ms is 
assumed and considered a part of the echo path function. 
Simultaneously, the far.end voice packet X k is t ransmitte 
; to a pa cket buffer 100 that p referably Fas a length of 8 0 
words (l byte is equivalent to 1 word) in st ep 200. The 
p acket nutter 10 0, which pre ferably fills up with new data 
-every It) ms, butters the^packet in step 202. The voice 
signals tTflrfl the punerea packet are transmit ted to a first 
^ada ptive filter EC1 102 and a second adaptive filter EC 2 104 
m ste p 204. As descnbecTin more detail hereinafter, the voice 
^si gnals transmitted to EC2 104 will not be processed unt il 
the LIT l\)2 has completed its operation cycle. Fhus. a delay 
mechanism, as known in the art, is use d inthe EC2 104 to 
delay the processing of the voice signals: 

EC1 102 will first process the voice signal s received from 
the packet buffer 100 . In st ep 206, the voice signals tran s- 
mitted from the packet buffer 1UU are convoluted with the 
initial 256 coefficients stored in the EC1 102 to generate first 
synthetic echo signals using the following formula: 

f^-ST^/V 0 to 255; 

where h - are coefficients of the filter. 

Th e first synthetic echo signals ¥ k (iy are th en su btracted 
from the real echo signa ls Y/° via a first subtracter 116 to 
generate the lirst erro r signals e^ <3) in step"20Sr iu M\) 210 , 
the first eiuU slgflaUt e^ are low filtered by a nrst energy 
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detecior 120 and th en transmitted to a comparato r 124. The 
energy generated fr&m first error signals e^ wui oe tem- 
porarily stored in the energy detecior 120. The error signals 

rc* U) arc also fed back to the EC1 102. 
After receiving the first error signals c Jt tl \ the 256 coef- 
ficients in the HC1 102 are updated according to the fol- 
lowing LMS algorithm in step 212: 



where k is the time, j=0 to 255, and A is the step size. 
Thereafter, the updated 256 coefficients are transmitted to 
and temporarily stored in a first coefficient storage 112. 
ITiese u pdated 256 coefficients stored in the first coeffici ent 
storage illi w ill then be copied into the fcCZ 104 in step21 4 
arid 5e Used as the initial coefficients of the EC2 104. 

On ce the operation cycle of the EC1 102 is com plete, the 
scconcT adaptive filt er ijfjj 104 begins processing tncTniff - 
ffcU Vul t'e slgUals. J he EL'2 104 will receive the identi cal 
voice Signals as tnose received by Lid 102 trom the packet 
buffer 100. 

The voice signals transmitted from the packet buffer 100 
are again convoluted with the coefficients stored in the EC2 
104 to generate second synthetic ecno signals 4/ 3 * using the 

following me in step n&. 

to 255; 

where hj are coefficients of the filter. In step 218, the second 
synthetic echo signals Y k t2) are again subtracted from the 
real echo signals Y fr t2) by a second subtractor 118 to generate 
the second error signals e fr (2) . In all cases, real echo signals 
Y^y^tsYfc'- 25 . The second error signals e A . t2) are low filtered 
by a second energy detecior 122 and transmitted to the 
comparator 124 in step 220. 'ITie error signals e A (2) are also 
fed back to EC2 104 in step 220. After receiving the second 
error signals e/ 2) , the 256 coefficients in EC2 104 are 
updated according to the following same LMS algorithm in 
step 222: 

where k is the time, j-0 to 255, and A is the step size. The 
updated 256 coefficients from the EC2 104 are then trans- 
^*>mitted to a second coefficient storage 114 in step 222. 

The c omparator 124 i n step 224 w ill co mpare the energies 
from first and second error signals c k 1 ^ anJc* 1 "^ Preferably, 
me compar ator 1Z4 will use a 10 ms short term avera ging 
methocrjjy comparing the two error signals, the comparator 
124 can det e r mine Which adaptive flliei, EC1 1 0 2 oi Et T2 
c iu 4,~has yieiciea sman er error signals (energy;, ana pre f er- 
V^_^apiy assign an output ' l to signity that the erro r signals c k {1) 
are greater than error signal e^. Thus, the comparator 124 
can determin e which adaptive filters, ECT 10 2 or EC2 104, 
perfo rmed better a nd select the voice packet 'corresponding 
to J he smaller error signals as us ouiput In step 1 226. The 
com parator 1 24 can a^itionally -selecl_th&^5 6 coefficients 
*l'rom e ither ltie~first coeliicient^sloi'a|e 1^2 6r the second 
' liOefficieni storage 114 that corresponds to me ad aptive filter 
ih^fj^r|Vsrm cd hciicr and transmit the selected coefficients 
i nto the EC1 1 02 for p r ocessing the next p acket in step 228. 

Preferably, the above-described steps 200 ihrougn zl8 of 
FI GS. 5A an d 5B are r epeated every 10 ms for each block 
jar packet. K*or every 10 ms, there will be 80 words t rans- 
" milled Jo the ECT 11)2 and EL'2 10J L ' 

At the converge point, the coefficient adaptation will be 
very small and remains that way until the echo path changes. 
As can be appreciated, the advantage of the present inven- 
ts not increasing the step size as a way to speed up the 
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converge time. A dou ble adaptation echo canceller compares 
t he performance Of Ihe two adaptive fitte rs and selects the 
one w ith better performanc e. Hi rough this process, a robust 
Stab ility and last converge time c an be achiev ed. 

Further, b oth adaptive niters ELI 102 and hc2 104 in the ' 
above example are preferabl y LMS base d a daptive digital 
fihers hu t other fillers asi n g Hi lie re n Fa 1 gonlh ms such as 
klS may betrsed iu ai'COlllauui with the ptestJUt fn"v*ention . 
~~ As stated earlier herein, alt hough a double adaptation 
system, a "d met hod has been described in detail, one skilled " 
in the art can appreciated that more than two adaptation 
filters, echo cancellers, can be implemented in accordance" 
1 with jhe present invention. Fo r example, once the o peration" 
e vele of the EC2 104 is complete, a third echo canceller c all 
begin pr ocessing the buffered voice signals. The thirdjehe- 
5 "canceller wouici receive tnc buffered voice signals from the 
packet buffer 100 and generate third synthetic echo signals 
based on the updated coefficients that are stored in the 
second coefficient storage 114. Ilie real echo signals would 
be subtracted from the third synthetic echo signals, thereby 
:o generating third error signals. The coefficients of the third 
echo canceller would be updated based on the third error 
signals and stored in a third coefficient storage. A compara- 
Tor compares the first, second, and thir d error sT§HJls Jiid - 
dUu mints WhlCh ech6 OatH'cllcr" fterlormed the Pest Alter 
. 5 - thr rnmu aiaim d^ ^rrWtn^u mni^fl ^ cno ca nceller performed 
^ the fa$l, ifie output signals are selected and tne upd ated 
coetnei ents trom the optimal echo canceller are inputted i nto 
' thu.fll^l e clio canceller lop prnce^ mgtkp np^t p^eppt Ac^an 
lie appreciated, this process can be implemented with N 
number of echo cancellers. Such implementation with N 
number of echo cancellers is illustrated in FIG. 6. 

Although various preferred embodiments of the present 
invention have been disclosed for illustrative purposes, 
those skilled in the art will appreciate that various 
modifications, additions and/or substitutions are possible 
without departing from the scope and spirit of the present 
invention as disclosed in the claims. 
I claim: 

1. A method for canceling a real echo associated with a 
^ voice packet, the method comprising the steps ofr". 

(a) providing the voice packet to a first echo canceller and' 
a second echo canceller; 

(b) generating a fi rst synthetic echo and a s econd synthe tic 
echo ; 

45 (c) subtracting th e first synthetic ccho_from the re al echo 
to ge nerate a Tirst error signa l, an d subtracting Trie 
second synthet ic echo from the real echo to gen erate a 
second error signal; and 
(d) determining whether the first echo canceller per- 

50 formed better than the second echo canceller based on 
the first error signal and the second error signal. 

2. A method according to claim 1 further comprising the 
step of buffering the voice packet before performing step (a). 

3. A method according to claim 2, wherein the buffering 
55 step is performed every 10 ms. 

4. A method according to claim 1, wherein step (b) is 
performed by a first echo canceller and a second echo 
canceller, respectively. 

5. A method according to claim 1, wherein the first 
synthetic echo is generated before the second synthetic echo. 

6. A method according to claim 1, wherein the first 
synthetic echo is generated with the first set of coefficients 
from a first echo canceller. 

7. A method according to claim 6 further comprising the 
65 steps of: 

updating the first set of coefficients stored in the first echo 
canceller based on the first error signal; 
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storing the updated first set of coefficients in a first 

coefficient storage; and 
copying the updated first set of coefficients into a second 

echo canceller. 

8. A method according to claim 7 further comprising the 
steps of: 

updating the updated first set of coefficients stored in the 
second echo canceller based on the second error signal; 
and 

storing the twice updated coefficients in a second coeffi- 
cient storage. 

9. A method according to claim 1, wherein step (d) further 
comprises the step of transmitting the first error signal and 
second error signal to a comparator. 

10. A method according to claim 9, wherein the compara- 
tor determines whether the first echo canceller performed 
better than the second echo canceller. 

11. A method for canceling a real echo associated with a 
voice packet, the method comprising the steps of: 

(a) providing the voice packet to N echo cancellers; 

(b) generating N synthetic echoes; 

(c) subtracting each of the N synthetic echoes from the 
real echo to generate N error signals; and 

(d) determining which of the N echo cancellers performed 
the best based on the N error signals. 

12. A method according to claim 11, wherein N comprises 

2. 

13. A method according to claim 11, wherein N comprises 

3. 

14. A method according to claim 11, wherein N comprises 

4. 

15. A system for canceling an echo, comprising: 

a first echo canceller and a second echo canceller adapted 
to generate a first synthetic echo signal and a second 
synthetic echo signal, respectively, based on a voice 
packet data; 

a first subtractor adapted to subtract the first synthetic 
echo signal from a real echo signal to generate a first 
error signal; 
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a second subtractor adapted to subtract the second syn- 
thetic echo signal from the real echo signal to generate 
a second error signal; and 

a comparator adapted to compare the first error signal 
with the second error signal for determining whether 
the first echo canceller performed better than the sec- 
ond echo canceller. 

16. A system according to claim 15, wherein the voice 
3Q packet data is received by the first echo canceller and the 

second echo canceller from an IP network. 

17. A system according to claim 15 further comprising a 
packet buffer adapted to buffer the voice packet data before 
the first echo canceller and the second echo canceller receive 

15 the voice packet data. 

18. A system according to claim 17, wherein the packet 
buffer receives new voice packet data every, 10 ms. 

19. A system according to claim 15, wherein the first echo 
canceller and the second echo canceller each includes an 

20 LMS adaptive digital filter. 

20. A system according to claim 15, wherein the first echo 
canceller and the second echo canceller each includes an 
RLS adaptive digital filter. 

21. A system for canceling an echo, comprising: 

N echo cancellers adapted to generate N synthetic echo 
signals based on a voice packet data; 

N subtractors adapted to generate N error signals, wherein 
each subtractor is adapted to subtract one of N synthetic 
30 echo signals from a real echo signal; and 

a comparator adapted to compare the N error signals for 
determining which one of the N echo cancellers per- 
formed the best. 

22. A system according to claim 21, wherein N comprises 

35 2- 

23. A system according to claim 21, wherein N comprises 

3. 

24. A system according to claim 21, wherein N comprises 

4. 

***** 
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